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Abstract 

The problem of designing policies for in-network function computation with minimum energy 
consumption subject to a latency constraint is considered. The scaling behavior of the energy 
consumption under the latency constraint is analyzed for random networks, where the nodes are 
uniformly placed in growing regions and the number of nodes goes to infinity. The special case of 
sum function computation and its delivery to a designated root node is considered first. A policy 
which achieves order-optimal average energy consumption in random networks subject to the 
given latency constraint is proposed. The scaling behavior of the optimal energy consumption 
depends on the path-loss exponent of wireless transmissions and the dimension of the Euclidean 
region where the nodes are placed. The policy is then extended to computation of a general 
class of functions which decompose according to maximal cliques of a proximity graph such as 
\ the fc-nearest neighbor graph or the geometric random graph. The modified policy achieves 

qq ■ order-optimal energy consumption albeit for a limited range of latency constraints. 

in '. 

QO . Keywords: Function computation, latency-energy tradeoff, Euclidean random graphs, minimum 

broadcast problem. 

1 Introduction 



O 

(N 

C 

oo 



A host of emerging networks are pushing the boundaries of scale and complexity. Data centers are 
being designed to distribute computation over thousands of machines. Sensor networks are being 
5h ' deployed in larger sizes for a variety of environmental monitoring tasks. These emerging networks 
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face numerous challenges and the threat of a "data deluge" is an important one. The data collected 
by these networks typically scale rapidly as their size grows. Routing all the raw data generated in 
these large networks is thus not feasible and has poor scaling of resource requirements. 

In this paper, we consider the scenario where only a function of the collected raw data is required 
at some specific node in the network. Many network applications fall into this category. For instance, 
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in a statistical inference application, where a decision has to be made based on the collected data, 
the likelihood function suffices to make the optimal decision [T] . Such functions can have significantly 
lower dimensions than the raw data and can thus considerably reduce the resource requirements for 
routing. 

In this paper, we analyze the scaling behavior of energy and latency for routing and computation 
of functions in random networks, where the nodes are placed uniformly in growing regions and the 
number of nodes goes to infinity. In particular, we address the following questions: how can we 
exploit the structure of the function to reduce energy consumption and latency? What class of 
functions can be computed efficiently with favorable scaling of energy and latency requirements? 
How do the network properties such as the signal propagation model affect the scaling behavior? 
What is the complexity for finding the optimal policy with minimum energy consumption under 
a given latency constraint for function computation? Are there simple and efficient policies which 
achieve order optimal energy consumption? The answers to these questions provide important 
insights towards engineering in- network computation in large networks. 

1.1 Summary of Contributions 

The contributions of this paper are three-fold. First, we propose policies with efficient energy 
consumption which compute any function belonging to a certain structured class subject to a feasible 
latency constraint. Second, we prove order-optimality of the proposed policies in random networks. 
Third, we derive scaling laws for energy consumption in different regimes of latency constraints for 
different network models. To the best of our knowledge, this is the first work to analyze energy- 
latency tradeoff for function computation in large networks. These results provide insight into the 
nature of functions which are favorable for in-network computation. 

We analyze the scaling laws for energy and latency in random networks, where n nodes are placed 
uniformly in a region of volume (or area) n in ]R rf , and we let n — > oo. We consider (single-shot) 
function computation and its delivery to a designated root node. We first consider the class of sum 
functions, which can be computed via an aggregation tree. We characterize the structural properties 
of the minimum latency tree and propose an algorithm to build an energy-efficient minimum latency 
tree based on successive bisection of the region of node placement. However, minimum latency comes 
at the expense of energy consumption and we relax the minimum latency constraint. Our modified 
algorithm achieves order-optimal energy consumption for any given latency constraint. It is based 
on the intuition that long-range communication links lower latency but increase energy consumption 
and the key is to strike a balance between having long-range and short-range communications to 
achieve the optimal tradeoff. 

We then consider the more general class of functions that decompose as a sum of functions over 
the maximal cliques of a proximity graph, such as the /s-nearest neighbor graph or the random 
geometric graph. These functions are relevant in the context of statistical inference of correlated 
measurements which are drawn from a Markov random field. See [1] for details. We extend the 
proposed sum-function policy to this case and prove that it achieves order-optimal energy consump- 
tion (up to logarithmic factors) albeit under a limited range of latency constraints. In this range of 
feasible latency constraints, the energy consumption is of the same order as sum function computa- 
tion. Hence, functions based on locally-defined proximity graphs can be computed efficiently with 
optimal scaling of energy and latency requirements. 

We analyze the scaling behavior of energy consumption under different regimes of latency con- 
straints and for different signal propagation models. We assume that the energy consumed scales 
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as R v where R is the inter-node distance and v is the path-loss exponent and consider nodes placed 
in a region in M. d . We prove that in the regime 1 < v < d, order-optimal energy consumption and 
minimum latency can both be achieved simultaneously. On the other hand, in the regime v > d, 
there is a tradeoff between energy consumption and the resulting latency of computation, and our 
policy achieves order-optimal tradeoff. 

1.2 Prior and Related Work 

There is extensive literature on in-network processing. Some of the earliest arguments for in- network 
processing for scalability are presented in [2j[3]. The work of Giridhar and Kumar [3] provides a 
theoretical framework for in-network computation of certain functions such as sum function and 
analyze scaling of capacity as the network size grows. However, the work in [!j is concerned with 
the rate of information flow when the function is computed an infinite number of times, while 
we consider latency of single-shot function computation. Single-shot computation is relevant in 
applications involving one-time decision making based on a set of measurements. Moreover, we 
consider a richer class of functions which decompose according to some proximity graph. These are 
relevant in statistical inference applications with correlated measurements. 

For the special case of sum-function computation, the minimum latency is the same as that for 
the minimum broadcast problem, where the root has information that needs to be disseminated 
to all the nodes. Most of the previous work on minimum broadcast problem, e.g., [5j[6] , have 
focused on obtaining good approximations for minimum latency in arbitrary networks, but do not 
address the issue of scaling behavior of latency- energy tradeoff in random networks. These works 
also assume that only short-range communication may be feasible for communication. On the 
other hand, we allow for a few long-range links but focus on obtaining favorable scaling of overall 
energy consumption. Works considering latency-energy tradeoff in multihop networks are fewer. 
For instance, the works in [7H9] consider energy-latency tradeoff for data collection but without the 
possibility of in-network computation, which can be significantly more expensive. The work in |10] 
considers latency-energy tradeoff but during the deployment phase of the network. 

With respect to analysis of energy scaling laws in randomly placed networks, the work in 
derives scaling laws for multihop routing without in-network computation. In |12| . the minimum 
energy policy for graph-based function computation is first analyzed in the context of statistical 
inference of correlated measurements and is shown to be NP-hard. An efficient policy is derived 
based on the Steiner-tree approximation. In [T], scaling laws for energy consumption are derived 
for computation of graph-based functions in random networks. When the function decomposes 
according to the cliques of a proximity graph, such as the /c-nearest neighbor graph or the random 
geometric graph, it is shown that the function can be computed with3 O(n) energy consumption in 
random networks, where n is the number of nodes. A simple two-stage computation policy achieves 
this scaling and is shown to have asymptotically a constant approximation ratio, compared to the 
minimum energy policy. In this paper, we extend the work to incorporate latency constraints and 
design policies which minimize energy consumption under the constraints. 

For any two functions f(n),g(n), /(n) = 0(g(n)) if there exists a constant c such that f(n) < cg(n) for all n > no 
for a fixed no £ N. Similarly, /(n) = f2(<?(n)) if there exists a constant c such that f(n) > c'g(n) for all n > no for a 
fixed n £ N, and /(n) = Q(g(n)) if f(n) = £l(g(n)) and /(n) = 0(g(n)). 
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2 System Model 



2.1 Communication and Propagation Model 

In a wireless sensor network, there are communication and energy constraints. We assume that 
any node cannot transmit and receive at the same time (half duplex nodes). We assume that 
a node cannot receive from more than one transmitter at the same time and similarly, a node 
cannot transmit simultaneously to more than one receiver. We assume that no other interference 
constraints are present. This is valid if nearby nodes transmit in orthogonal channels or when they 
have idealized directional antenna which can focus the transmissions within a narrow region around 
the receiver (e.g., |13[I14|). We also assume that nodes are capable of adjusting their transmission 
power depending on the location of the receiver leading to better energy efficiency. 

We assume unit propagation delays along all the communication links and negligible processing 
delays due to in-network computation at nodes. For a transmission along edge (from node 

i to node j), the energy consumption^] is equal to R\p where Rij is the Euclidean distance and 
typically v G [2,6] for wireless transmissions. In this paper, we allow for any v > 1. 

2.2 Stochastic model of sensor locations 

Let Q n C M. d denote the d-dimensional hypercube [0, n 1 ' d ] d of volume n, and typically d = 2 or 3 
for sensors placed in an Euclidean region. We assume that n sensor nodes (including the root) are 
placed uniformly in Q n with sensor i located at Vi G We denote the set of locations of the n 
sensors by V n :={Vi, . . . , V n }. For our scaling law analysis, we let the number of sensors n — > oo. 
Denote the root node by r, where the computed function needs to be delivered, and its location by 
V r . 

2.3 Function Computation Model 

Each sensor node i collects a measurement Y{ G y, where y is a finite set, and let Y n = {Yi, . . . , Y n } 
be the set of measurements of n nodes. We assume that the goal of data aggregation is to ensure 
that a certain deterministic functional ^ : (Y n , V n ) i— > R is computable at the root r at the end of 
the aggregation process. The set of valid aggregation policies ir is thus given by 

3(V n ; ^):={tt : *(Y n , V„) computable at r}. (1) 

Using the propagation model discussed in Section 12. 1| the total energy consumption of the aggre- 
gation process under a policy ir G 5(V n ; ^) is 

S«(V n ) := R ei 

where is the set of links used for inter-node communication by the policy 
function computation is 

L n (V n ; $):=inf[£ : *(Y n , V n ) computable at V x at time t], (3) 

2 Since nodes only communicate a finite number of bits, we use energy instead of power as the cost measure. 
3 In general, the function can depend on the locations where the measurements are collected. 
4 We consider one-shot function computation. 



(2) 
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Figure 1: Example of a function dependency graph 9 and the function decomposes in terms of the 
maximal cliques of the graph, as represented by dotted lines. 

where the aggregation process starts at t = 0. Let L*(V„; ^) be the minimum latency over the set 
of valid policies. 

If no further assumptions are made on the function ^f, then all the measurements Y n need to 
be delivered to the root without any in-network computation. This is expensive both in terms of 
latency and energy consumption. Typically, the function ^ decomposes into sub-functions involving 
only subset of measurements. In this case, in-network computation can be carried out to enable 
efficient tradeoff between energy consumption and latency of computation. We assume that the 
function ^ has the form, 

*(V n ,Y n ) = ^VU(^)to), (4) 
cee 

where C is the set of maximal clique^ on some graph Si<- See Fig{T]for an example. Note that this 
graph is related to the function ^ and not with communication links. We refer to as the 
function dependency graph. 

We consider the case when the grapfjf] 9 is either a /c-nearest neighbor graph (fc-NNG) or the 
p-random geometric graph (p-RGG) with threshold radius p, where k, p are some fixed constants, 
independent of the number of nodes n. These graphs are relevant choices since many functions are 
based on proximity of the nodes. For instance, in the context of statistical inference, this corresponds 
to node measurements being locally dependent according to a Markov random field with the given 
graph S(V n ). See [1] for details. 

2.4 Energy-Latency Tradeoff 

Denote the minimum latency for function computation over the set of valid policies by L*, i.e., 

L*(V n ;S*):=inin J L 7r (V n ;g*). (5) 

The policy achieving minimum latency L* can have large energy consumption and similarly, policies 
with low energy consumption can result in large latency. Hence, it is desirable to have policies that 

A clique is a complete subgraph and is maximal if it is not contained in a bigger clique. 
6 In fact, our results hold for a general class of graphs satisfying a certain stabilization property. See |15| for details 
and examples. 
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Figure 2: The latency for an aggregation tree can be obtained iteratively by considering subtrees. 
See Proposition [TJ 

can tradeoff between energy consumption and the latency of function computation. We consider 
finding a policy with minimum energy consumption subject to a latency constraint, 

£*(V„;<5,3*) :=min^(V„ ; g*), s.t. L w < L* + 5, (6) 

where 5 (which can be a function of n) is the additional latency suffered in order to reduce energy 
consumption. In general, finding §§§ is NP-hard for nodes placed at arbitrary locations (since the 
special case of this problem of finding minimum energy policy with no latency constraints is NP- 
hard [16]). We instead propose a policy which has energy consumption of the same order as the 
optimal policy for randomly placed nodes V n , as n — > oo, and for any given latency constraint. 

3 Sum Function Computation 

A sub-class of functions in (|15p is the set of sum functions 

n 

tt(V„,Y n ) (7) 

i=l 

which have the maximum extent of decomposition over the set of nodes. Computing sum functions 
is required in various network applications, e.g., to find the average value, in distributed statistical 
inference with statistically independent measurements |T], and so on. 

3.1 Preliminaries 

We first discuss the policy to achieve minimum latency L*(V n ; in ([5]) for sum function compu- 
tation without considering the energy consumption. In this case, the minimum latency does not 
depend on the position of the nodes V n but only on the order of scheduling of the various nodes, i.e., 
L*(V n ;^) = L*{n). Moreover, the minimum latency L*(n) can be achieved via data aggregation 
along a spanning tree T*(n), directed towards root r. 

For data aggregation along any directed spanning tree T, each node waits to receive data from 
its children (via incoming links), computes the sum of the values (along with its own measurement) 
and then forwards the resulting value along the outgoing link. See Figj3]for an example. Let Lt be 
the resulting latency along tree T. We now make a simple observation. See also Figj2] 
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Figure 3: The min. latency tree T* with edge level labels. See Alg{TJ 

Proposition 1 (Latency along a tree) For a spanning tree T with root r, the latency Lt is given 
by 

L T = max {i + LTi}, (8) 

i=l,...,k 

where Ti is the subtree rooted at node i, and 1, . . . , k are the children C(r;T) of the root node r 
ordered such that Lt ± > Lt 2 • • • > Lx k • 

Proof: Indeed, after time Lr i , information from 1, . . . ,i has still not been sent to the root, and 
this will take at least time i, so Lj- > i + for alH = 1, . . . , k. Conversely, there is a simple policy 
with latency in (JHJ) which aggregates along the subtrees with latency and then node i sends 
its data to the root r at time slot Lt —i. □ 
Using ([8]) we can thus effectively compute the latency of any given rooted tree T. We now 
provide the result on the minimum latency L*{n) and the construction of the tree T*(n) achieving 
it. This has been previously analyzed in the context of minimum broadcast problem [5], where the 
root has information that needs to be disseminated to all the nodes. 

Lemma 1 (Minimum Latency Tree) The minimum latency for sum function computation over 
n nodes is L* (n) = |~log 2 n] . Equivalently, the maximum number of vertices in a tree with latency 
at most L is 2 L . 

Proof: See Appendix [Aj □ 
There is a unique minimum latency tre^ll T*(n) up to a permutation on the nodes. The minimum 
latency tree can be constructed recursively as explained in Algorithm [TJ The algorithm runs for 
L*{n) steps and in each step, a child is added to each node already in the tree. An example of the 
minimum latency tree is shown in Figj3] 

3.2 Policies for Energy Latency Tradeoff 

We now propose a policy for sum function computation with order-optimal energy consumption 
subject to a given latency constraint in ([6]). Note that the minimum latency tree T*{n) does not 
depend on the node locations and any permutation of the nodes on the tree (with the root fixed) 
results in the same latency. On the other hand, the energy consumption depends on the node 
locations. We propose an energy-efficient minimum-latency tree T*(V n ) in Algorithm [2J depending 
on the node locations. This will be later proven to achieve order-optimal energy consumption for 
uniformly placed nodes. We first note some definitions used in the sequel. 

7 Note that the balanced binary tree on n nodes has latency 2[log 2 (n + 1)] — 2, which is about twice L*(n). 



7 



Algorithm 1 Min. latency tree T*(n). 



Input: nodes N = {1, . . . , n}, root node r. C(i; T) denotes children of node i. S(k; T) denotes level 

k edges in T. For any set A, let A {r} denote A <— A U {r}. 
Output: T*(n). 

1: Initialize set A = {r} and T* = {r}. 

2: for k = 1, . . . , [~log 2 n] do 

3: B <- A. 

4: for each i G 5 do 

5: if iV \ A ^ then 

6: For some j € N \ A, C(i; T*) <— j (j is now a child of i), S(k; T*) (level k edges) 



and A <— j. 




Figure 4: The min. latency tree T*(V n ) over 15 nodes with edge level labels placed in square region. 
See AlgJ2 



Definitions: For a rooted tree T and a node i, let C(i; T) denote the children of i. The level l{e;T) of 
a link e in an aggregation tree T is given by Lt~ t e , where t e is the time at which data is transmitted 
along link e (time is the start of the aggregation process). Note that the level depends on both 
the tree structure and the transmission schedule on the tree. Let 

S(k; T) := {e : l(e; T) = k,e£ T}. (9) 

be the set of level k edges in tree T. See FigEJ Let SP; (i,j;V n ) denote the least-energy patlH 
between i and j with at most I > intermediate nodes when the node locations are V n . For a 
rectangular region Q C M. d containing a subset of nodes and a reference node i such that Vi £ Q, 
let Bi(Q;i),B2(Q;i) be the two halves when bisected along the coordinate with the largest extent 
such that Bi(Q;i) and B2(Q',i) have equal number of nodes with V{ £ B\{Q;i). 

We propose an energy-efficient minimum-latency tree T*(V n ) in Algorithm [2] and, prove in 
Section 13.31 that T*(V n ) achieves order-optimal energy consumption for uniformly placed nodes 
subject to the minimum latency constraint. In Algorithm [21 every added node in the tree picks 
a new child, as in Algorithm [U but now the children are chosen based on the node locations. 
Specifically, in the first iteration, the region of node placement is bisected (with equal number of 

8 Note that the least-energy path depends on the path-loss exponent v and for larger v, multi-hop routing is more 
energy efficient than direct transmissions. 
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Algorithm 2 Min. lat. tree T*(V n ) with order opt. energy 

Input: Locations of nodes: V n = {V±, . . . ,V n }, root node r, Q n C M. d : region where the nodes 
are placed. C(i;T) denotes children of node i. S(k;T) denotes level k edges in T. For a 
rectangular region Q and a node % with V% € Q, let Bi(Q; i) and B 2 {Q] i) be the two halves with 
Vi € Bi(Q; v). For any set A, let A ¥- {r} denote A <- A U {r}. 
Output: T*(V n ). 

1: Initialize A <— {r}. R r ^— Q n . 

2: for k = 1, . . . , |~log 2 n] do 

3: B ^ A. 

4: for each i £ B do 

5: if V n nB 2 (^;j) 7^0 then 

6: For some node j s.t. V,- G B 2 (Rf,i), C(i;T*) j, S(k;T*) A ^ {j}, <- 

B 2 (Ri;i) and ^ <- 



nodes in each half), and the root chooses a child in the other half. The region assigned to the root 
is now the half-region (where it is located), while the added child node is assigned the other half- 
region. The subsequent iterations proceed in a similar manner and each node bisects its assigned 
region into two halves and picks a child in the other half, and updates the assigned regions. 

The algorithm [2] considered energy-efficient policy under the minimum latency constraint. We 
now present the policy 7r AGG for any given latency constraint in Algorithm [3j The difference between 
the two cases is that in the latter case, a lower energy consumption is achieved by exploiting 
the relaxed latency constraint. Intuitively, long-range (direct) communication entails more energy 
consumption than multi-hop routing, especially when the path-loss exponent v is large. On the 
other hand, latency is increased due to multihop routing. The key is to carefully convert some of 
the long-range links in T*(V n ) into multi-hop routes to lower the energy consumption and take 
advantage of the additional allowed latency. 

In Algorithm [3j the regions are bisected and new nodes are chosen as children, as in Algorithm [2] 
But instead of directly linking the nodes in the two hops, the least-energy route is chosen with at 
most Wk intermediate routes, where wt is a fixed weight. The nodes that are already added in 
this manner are not considered for addition as children in the subsequent iterations. In general, 
the resulting set of communication links is not a tree, since the least-energy paths constructed in 
different iterations may share the same set of nodes. But the sum function computation can be 
carried out on similar lines, as on an aggregation tree. We now relate the weights (w^) with the 
latency of the resulting policy 7r AGG in Algorithm [H 

Proposition 2 (Latency under Algorithm [3]) The aggregation policy -k agg in Algorithm^ for 
a given set of weights w achieves a latency of 

[log 2 n\-l 

L^ AGG {n)<L*{n)+ w k . 

k=0 

Proof: There are at most [log 2 n\ iterations and the total delay is 

[log 2 n"|-l [log 2 n]-l 

{l + w k )=L*{n)+ w k- 

k=0 k=0 
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Algorithm 3 Latency-energy tradeoff policy 7r AGG (V n ; w). 

Input: Locations of nodes: V n = {Vi, . . . ,V n }, root node r, and set of weights Wk for k = 
0,..., |~log 2 n\ — 1. For a rectangular region Q and node v E Q, let B\{Q;v) and B 2 (Q;v) 
be the two halves with v € B\(Q;v). Let SP;(i,j; V re ) be /-hop least-energy path. Q„ C M. d : 
region where the nodes are placed. For any set A, let A <— {r} denote A A U {r}. 
Output: G n : communication links used by policy 7r AGG . 
1: Initialize Ai,A 2 ^— {r}. R r ^— Q n . 
2: for k = 0, . . . , |~log 2 n] — 1 do 
3: B <- Ai 
4: for each iGBdo 
5: if (V n nB 2 {R l ;i))\A 2 /0then 
6: Pick j s.t. E 62(^5 i) \ A 2 , Ai {j}, 

G7r AGG ^ gp wfc(t . ji; Vn)) ^ u. SP^ fc (i, j; V n ), 
i?j «- B 2 {Ri;i) and i?j «- Bi(-R,;i). 



□ 

Thus, the weights (wf.) can be chosen to satisfy any given latency constraint and we have a 
policy 7r AGG for sum function computation given any feasible latency constraint. The analysis of 
energy consumption under 7r AGG for a given set of weights is not straightforward to analyze and 
forms the main result of this paper. This is discussed in the next section. 

3.3 Order-Optimality Guarantees 

To achieve optimal energy-latency tradeoff according to ([6]) , we choose weights Wk in Algorithm [3j 
for k = 0, ... , |~log 2 n] — 1, as 

' LC^2 fc(1/l/ ~ 1/d) J if v>d, 

_ o.w. (10) 

where 5 is the additional latency allowed in ©, v is the path- loss factor for energy consumption in 
d2|) and d is the dimension of Euclidean space where the nodes are placed. The normalizing constant 
( is chosen as 

f 1 - 2 l ' u ~ l l d , if v > d, 

Jlog 2 n]-\ u = d, (11) 

so that X^I=o 2 ^ 1 Wk — ^- Hence, from Lemma [2j the weights in (|10p result in a policy 7r AGG with 
latency L*(n) + 5. We now provide the scaling behavior of optimal energy consumption as well the 
order-optimality result for 7r AGG . 

Theorem 1 (Energy-Latency Tradeoff) For a given additional latency constraint 5 = 5(n) > 
and fixed path-loss factor v > 1 and dimension d > 1, as the number of nodes n — > oo, the minimum 
energy consumption for sum function computation satisfies 



c 



E(r(V n ;<5))= 



'G(n) v < d, 

0(max{n,n(logn)(l + I ^) 1 -^}) v = d, 
6(max{n,n^(l + (5) 1 - ly }) u > d, 
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Figure 5: Scaling of minimum total energy E* in different regimes of path loss v and dimension d. 
See Theorem [TJ 

where the expectation is over the locations V n of n nodes chosen uniformly at random in [0, n l / d ] d 
and is achieved by the policy tt agg in Algorithm^ for weights given by (|10p . 

Remarks: 

(i) The policy 7r AGG in Algorithm [3] thus achieves order-optimal energy consumption under any 
feasible latency constraint when v ^ d. For the case v = d, we show that E[£*(V n ; 5)] = Q(n) while 
the energy consumption under 7r AGG is the upper bound in Theorem Q] given by O(ralogn), i.e., the 
energy consumption is at most only logarithmically worse than the lower bound. 

(ii) The result of Theorem Q] holds even if the latency constraint is relaxed to an average constraint, 
i.e., 

E[L*(V n )] <L*(n) + 5. 

From Theorem [H the energy consumption has different behaviors in the regimes v < d and 
v > d, as represented in FigJSJ and we discuss this below. 

Case v < d: In this regime, the path-loss factor is low, and hence, long-range transmission does 
not suffer a high penalty over multihop routing. This is also favorable for latency performance and 
hence, in this regime minimum latency of |~log 2 n] can be achieved with G(n) energy consumption. 
Note that the aggregation tree with the minimum energy consumption is the minimum spanning tree 
(MST) and the expected energy consumption for MST under uniform node placement is also @(n). 
Hence, our policy 7r AGG achieves both order-optimal energy and minimum latency simultaneously 
in the regime v < d. 

Case v > d: In this regime, multihop routing is much more favorable over direct transmissions with 
respect to energy consumption while direct transmissions are favorable for low latency. Hence, both 
low energy and low latency cannot be achieved simultaneously in this regime. Our policy 7T AGG 
achieves order-optimal energy consumption subject to a given latency constraint in this regime. 
Note that typically, for sensor networks placed in two-dimensional area (d = 2) with wireless 
transmissions (y S [2,6]), this regime is of interest. 

Comparison with MST: If the minimum spanning tree (MST) is used for aggregation, it results in 
minimum energy consumption which is G(n) under random node placement. However, the expected 
latency of aggregation along the MST is at least the depth of the MST and hence, the latency and 
energy satisfy, 

E[Z7 MST (Vn)] = E[£- MST (V„)] = 8(n). (12) 
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In contrast, under our policy 7r AGG , we can obtain, when v < d, 



E[I7 AGG (V re )] = flog 2 nl, E[^ AGG (V n ,)] = 8(n). (13) 
For the case when v > d, our policy achieves 

E[L* (V n )] = 9(n— ), E[5 W GG (V n )] = 0(n), (14) 

v]_d-\_ 

by setting 5 = n in Theorem [T] Thus, our policy 7r AGG is especially advantageous over using 
the MST when the path loss v is small. Moreover, the policy 7r AGG can be designed based on the 
latency requirements while the MST cannot be easily modified to satisfy them. 



4 General Function Computation 

We now extend the latency-energy tradeoff policy to undertake general function computation. Recall 
that we consider the class of functions of the form 

*(V n ,Y n )= M(Yi)iec), (15) 

cee(v n ) 

where C(V n ) is the set of maximal cliques on a graph S(V n ), known as the function dependency 
graph and the functions ip c are clique functions. We consider the case when the graphU is either a 
£;-nearest neighbor graph (fc-NNG) or the /9-random geometric graph (/9-RGG) with threshold radius 
p, where k,p are some fixed constants, independent of the number of nodes n. 

Note that the function ^ in (1 1 5 [) now depends on the location of the nodes V n which is not 
the case with the sum function. Hence, the latency-energy analysis has to take this into account. 
We propose modifications to the latency-energy tradeoff policy 7r AGG to enable general function 
computation and then prove its order-optimality for latency-energy tradeoff. 



4.1 Preliminaries 

The extent of decomposition of function depends on the sparsity (number of edges) of the function 
dependency graph 9- We first make a simple observation that the energy consumption and latency 
increase with more edges in 9- 

Proposition 3 (Energy consumption and sparsity of 9) The minimum energy in ([6]) required 
to compute functions of the form in (1 1 5 [) under a fixed additional latency constraint 5 > with de- 
pendency graphs 9 and 9' satisfies 

S*(V n ; 5, 9) > £*(V n ; 5, 9'), when 9 D 9'. (16) 

Proof: Let C (resp. C be the set of maximal cliques in 9 (resp. 9')- Since 9' C 9, each new clique 
c G 6 \ 6' replaces a smaller set of cliques d C c,d € C The latency and energy consumption 
for any valid policy in computing the clique function ^ c ((i^)j gc ) is at least that of computing 
Vv(Y c /), since this is a further decomposition of ip c . Hence, the result. □ 

9 In fact, our results hold for a general class of graphs satisfying a certain stabilization property. See |15| for details 
and examples. 
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Algorithm 4 Policy 7r CLQ (V n ; w, C) for general functions. 

Input: Locations of nodes: V n = {V\, . . . , V n }, root node r, C: set of maximal cliques of function 
dependency graph S(V n ). For each c G C, V(c) is the processor (node computing clique function 
ifj c ). For any set A, let A <— {r} denote i<- AU {r}. 

CLQ AGG 

Output: 7r CLQ policy with data forwarding links F n and aggregation links G 77 
1: for each clique c G C do 

2: For node i £ c with smallest label, 7 3 (c) <— z. 
3: For all nodes j G c, j 7^ i, i^ 7r ° LQ (j 5 £). 

CLQ 

4: Let Z e be the color for edge e G F n under proper edge coloring with colors I = 1, 2, . . . A + 1. 
5: for f = to A do 

CLQ 

6: Send measurements using links in F n of color t + 1. 
7: Find sum of clique functions using 7r AGG from Algorithm [3j 



Hence, the ability to obtain efficient scaling of latency and energy consumption for function 
computation depends on the sparsity of the function dependency graph 9- The extreme case of a 
trivial graph (9 = 0) is the sum function, analyzed in the previous section, while the other extreme 
is the complete graph (9 = K n ), where there is no decomposition of the function. In the latter case, 
no in-network computation is possible and all the measurements Y n need to be routed to the root 
via least-energy paths. We have the following scaling in this scenario. 

Proposition 4 (Scaling Under No Computation) The minimum latency and minimum en- 
ergy (with no latency constraint) for computation of a function with dependency graph K n satisfies 

E[S*(V n ;oo,K n )} = 6(n 1+1 / d ), E[L*(V n ;K n )] = U(n). 

The result on minimum energy follows from the scaling behavior of energy for least-energy path 
routing to the root under uniform node placement |17| . The latency of function computation is at 
least n since the root can receive at most one measurement value at each timestep and there is no 
aggregation of the measurements. Hence, we can expect efficient scaling of energy and latency only 
in case of computation of functions with sparse dependency graphs 9- 

Moreover, the energy consumption also depends on the edge lengths of the function dependency 
graph 9- Intuitively, when the graph 9 has local edges, the clique functions ip c can be computed 
locally resulting in low energy consumption. This holds for the proximity graphs such as the A;-NNG 
and the /9-RGG ensure under consideration. We propose policies for latency-energy tradeoff which 
are efficient for such locally-defined dependency graphs. 

4.2 Policy for Latency-Energy Tradeoff 

We now extend the policy 7r AGG in Algorithm [3] for general function computation as a two- 
stage policy 7r CLQ . In the first stage known as the data forwarding stage, the clique functions 
tp c are computed locally within each maximal clique c G C of the graph 9 as follows: a clique 
processor is chosen as a clique member with the smallest label (under arbitrary labeling of nodes) and 
other clique members communicate their measurements to the processor via direct transmissions. 

CLQ 

The transmissions are scheduled as follows: the set of forwarding links F n are assigned colors 
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I = 1,2, . . . , A + 1 under a proper edge coloring. At times t = 0, 1, . . . , A, transmissions along 
links of color t + 1 are scheduled simultaneously. In the second stage, the aggregation policy 7T AGG 
in Algorithm [3] is used for computing the sum of the clique function values at the processors (and 
nodes other than processors do not have their own values for aggregation but participate in the 
process). This is summarized in Algorithm |4j 

We obtain the following result for energy-latency tradeoff^! for general function computation. 
Let A (3) denote the maximum degree of the function dependency graph 3 in (|15p . which is either 
the /c-NNG or the p-RGG, where k and p are fixed constants. 

Theorem 2 (Energy-Latency Tradeoff) For a given additional latency constraint 5 > A(S) + 1 
in (|6|) , the energy consumption for function computation of the form (|15p with dependency graph 3 
under the two-stage policy tt clq satisfies 

E(£* CLQ (V n] 5, 3)) = 6(E(£ *(V n ; 5 - (A + 1), 0))), 

where the expectation is over the locations V n of n nodes chosen uniformly at random in [0,n 1 / <i ] d 
and the right-hand side is the minimum energy consumption for sum function computation under 
latency constraint of 5 — (A + 1) which is given by Theorem[l\ 

Remarks: 

(i) The policy 7r CLQ achieves order-optimal energy consumption for cases v < d, 5 > A + 1 and 
v > d, 5 3> A. This is because the minimum energy consumption £*(V n ; 6, 3) is lower bounded by 
the minimum energy for sum function computation from Proposition [3j Theorem [T] provides the 
scaling for minimum energy for sum function computation. Comparing it with the energy under 
7r CLQ policy in Theorem [2] we note that they are both G(n) when v < d. For the case v > d, they 
are still of the same order if the maximum degree A (3) is small compared to additional latency 
constraint 5. 

(ii) The maximum degrees of fc-NNG and />RGG satisfy 

lno" 77 

A(/c-NNG) = (c d + l)k, A(p-RGG) = ), (17) 

log log n 

where a is a constant (depending only on d). See \18\ Cor. 3.2.3] and |19l Thm. 6.10]. Hence, 
for these graphs, 7r CLQ policy is order-optimal (up to logarithmic factors) for any path-loss factor 
v 7^ d and under any additional latency constraint 5>A(S) + 1- The above discussion also implies 
that the minimum energy for sum function computation and general function computation are of 
the same order for fc-NNG and p-RGG dependency graphs. Hence, these functions are amenable to 
efficient latency-energy tradeoff. 

(iii) The policy 7r CLQ can achieve a latency of |~log 2 n \ + A (3) + 1- Finding the policy with minimum 
latency L* in ([5]) for general function computation is NP-hard. However, we have L* > [log 2 n], 
since the minimum latency cannot be smaller than that required for sum function computation. 
We ensure that an additional latency constraint of 5 is satisfied in ([6]) by relaxing the constraint as 
L < |~log 2 n\ + 5. Since 7r CLQ can only achieve latencies greater than [~log 2 ra] + A(3) + 1, we can 
only ensure that constraints 5 > A(S) + 1 are met. 



°The latency constraint V < L* + 5 is required a.s. over the realization of points V,, 
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5 Conclusion 



In this paper, we considered energy-latency tradeoff for function computation in random networks. 
While designing optimal tradeoff policies in arbitrary networks is intractable, we proposed simple 
and easily implementable policies which have order-optimal performance and are relevant in large 
networks. We analyzed the scaling behavior of energy consumption under a latency constraint for 
computation and showed that it depends crucially on the path-loss exponent of signal propagation, 
the dimension of the Euclidean region where the nodes are placed and the extent to which the 
function is decomposable. For functions which decompose according to cliques of a proximity graph 
such as the k nearest-neighbor graph or the random geometric graph, efficient tradeoff can be 
achieved and the energy and latency having optimal scaling behaviors. 

This work opens up an array of important and challenging questions which warrant further 
investigation. While, we considered exact computation of a deterministic function, we expect that 
relaxing these assumptions will lead to a significant improvement of energy and latency scaling. 
We assumed single-shot data aggregation. Extensions to the setting of continuous monitoring and 
collection, where block coding is possible is of interest. We considered a single root node as the 
destination for the computed function, while in reality different nodes may require different functions 
to be computed. An extreme case of this scenario is the belief propagation (BP) algorithm which 
requires computation of maximum a posteriori (MAP) estimate at each node based on all the 
measurements, which are drawn from a Markov random field. Considering scenarios between these 
extremes and designing efficient schemes for energy-latency tradeoff is extremely relevant to many 
network applications. 

A Proof of Lemma Q] 

We prove by induction on L that the maximum number of vertices in a tree of latency at most L 
is exactly 2 L . This is clear for L = as such a tree must consist of just the root. Now assume 
L > and suppose T is a tree with latency L. Consider the edges that transmit information at 
the last time step L. Clearly these must transmit to the root r. But the root can only receive 
information from one child at a time. Thus there is precisely one edge (r, i) along which information 
is transmitted at time L. Removing the edge (r, i) splits the tree T into two trees T r and T{ rooted 
at r and % respectively. For all the data to be received at r in time L, all the data must be received 
at either r or i by time L — l. Thus both T r and Tj are trees of latency at most L — 1. By induction 
T r and Tj have at most 2 i_1 vertices. Thus T has at most 2 i ~ 1 + 2 L ~ l = 2 L vertices. Conversely, 
given two copies of a rooted tree on 2 i_1 vertices with latency L—l, one can construct a tree with 
latency L on 2 L vertices by joining the roots r, i of these two trees with an edge (r, i), and declaring 
one of the two roots, say r, to be the root of the resulting tree. The transmission protocol to achieve 
latency L is simply to follow the protocols on each tree for the first L—l steps, and then transmit 
all data at i from i to r at time step L. 

As any rooted subtree of a tree T has latency at most Lt, it is clear that the minimum latency 
of any tree on n vertices is L = |~log 2 n \ , and this can be achieved by taking any rooted subtree of 
the tree on 2 L vertices constructed above. □ 
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B Proof of Lower Bound in Theorem [T] 

Note that for v < d, since the MST has energy 0(n), the result follows. For the case v > d, consider 
an arbitrary spanning tree with root r. Consider the path P u from r to u in the tree. Let R(P U ) be 
the length of P u , i.e., the sum of the lengths of the edges of P u . Then with high probability 

^R{P U ) > 5^||«-r|| > cnn 1/d . (18) 

11 u 

for some constant c > 0. Indeed, with high probability, at least one half of the nodes lie at distance 
at least ^n l / d from r. Let n e be the number of paths P u that go through e, so n e is the number of 
vertices below e in the tree. Then 



e 

Indeed, ^ R(P U ) counts the length of e exactly n e times. Now St = X] ^e> so by Holder's inequality 

'EK) 1/ TE< /( "- 1, ) < ""^E^" e £ us) 



Thus it is enough to find an upper bound on ^ rte^ . If e is at distance i from r then the latency 
of the tree from e onwards is at most L* + 5 — i. But this means it has at most 2 L + 5_t < (2n)2 <5_2 
vertices. Hence n e < n2 1+5_i . Also, for each i we have 

n e < n 

e:dist(e, r)=i 

as each vertex can be counted in at most one n e with d(e, r) = i. Thus 

dist(e,r)=i dist(e,r)=i 

for i > 5, and 

dist(e,r)=i 

for i < 5 as we always have n e < n. The first sum is decreasing geometrically in i, so 
^ n ^-i) = (J + 0(l))nn 1 /^- 1 ) =0(1 + 5)W^- 1 ). 

e 

Thus by (fT9|) . 

fV"(l + ^("-iJ/^n > c 'nn l / d . 

Hence 

£ T = ftK /d (l + <5) 1_l/ ) 

as required. 

The proof that 7r AGG in Algorithm [3] provides the correct upper bound is given in Appendix [DJ 
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C Proof of Theorem [2] 



For policy 7r CLQ , the two stages of data forwarding and aggregation do not overlap. Hence, the 
latency and energy consumption under 7r CLQ are sum of the respective quantities in the two stages. 
The aggregation stage uses the 7r AGG policy and we have results from Theorem [TJ We now derive 
latency and energy scaling laws for the data-forwarding stage under 7r CLQ policy. Note that these 
depend on the function dependency graph 9- 

We claim that the latency in the data forwarding stage of 7r CLQ is at most A (9) + 1. This is 

CLQ 

because the forwarding graph is a subgraph of the dependency graph (F w C 9) (with directions 
ignored) since each edge in 9 is traversed at most once, under the clique processor selection procedure 
described in Algorithm HI Note that any graph with maximum degree A has a proper edge coloring 
using at most A + 1 colors (Vizing's theorem) |20] , By scheduling transmissions of a single color 
in each timestep, we can ensure that each node is either transmitting/receiving from at most one 
node. The energy consumption is at most the sum of the power-weighted edges of 9 and hence, 

£ (vr CLQ ; 6, 9) < J2 R e + (vr AGG ; 5, 0). 
ees 

From [21] . for stabilizing graphs 9 (which include /c-NNG and RGG), we have for uniform node sets 
V n , 

lim - Y R u e =C<oo. 
eeS(V n ) 

Hence, IE^eeS R"] = 0(n) and we have the result. 

D Upper bound construction for Theorem 1 

We shall need the following technical lemmas. 

Lemma 2 Let C > 1, d > 1 and v > be constants. Let x be a point of the rectangular paral- 
lelepiped Q = nf=i[0' a «] ^ ^ °f volume n = Y[ a i> an< ^ bounded aspect ratio, ai/aj < C for all 
i, j. Then as n —> oo, 

E( min ||w - x\\ v ) = 6(1), 
wev„ 

where the expectation is over all sets V n of n points in Q chosen independently and uniformly at 
random. 

The implied constants may depend on C , d and v, but not on n. 

Proof: Let r > and set V r to be the volume of the set of points in Q that are within distance 
r of x. Clearly V r is at most the volume of a <i-dimensional sphere of radius r, i.e., V r < cr d for 
some constant c = c{d). For a lower bound we note that V r > 2" d cr d if r is sufficiently small, the 
worst case being when x is at a corner of Q. However, for a general bound valid for larger r we use 
the following observation. If r = diamQ, the diameter of Q, then V r is the volume of Q, which is 
just n. If r < diamQ then, by shrinking Q by a linear factor rj diamQ about x and noting that 
the resulting set Q' lies inside Q n V r , we see that V r > \Q'\ = (r j diam Q) d n. As ai/aj < C for all 
i, j, we have diamQ = 0(n 1 / d ) and hence V r > c'r d for some constant c' = c'(d, C) > 0. 
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Using integration by parts we can write 



/•diam Q 

E( min \\w — xll") = / r^f— 4-P( min llu; — x\\ > r) dr, 



[ — r u F( mm \\w — x\\ > r )j|j iam( 2 



rdiam Q 



ramm y 

+ / (4-r v )¥(mm \\w — x\\ > r) dr 

Jo Kdr ' l »ev„ 11 11 " ' 

/■diam Q 

I vr u ~ l F{ min \\w — x|| > r) dr 

Jo wev n 



as v > and (for n > 0) there is always a point within distance diam Q of x. However, min^ g v„ || w- 
x\\ > r if and only if there is no point of V„ in V r . Thus 



1 - V r < P( min ||to - x|| > r) = (1 - V r /n) n < exp(-V r ). 

t»6V„ 

Hence 

vr v ~ l (l - cr d ) dr < E( min \\w - x\\ u ) < / vr v ~ x exp(-c'r d ) dr, 

where we have restricted the range of integration to a constant for the lower bound (which is less 
than diamQ = S(n 1 / d ) for sufficiently large n), and extended the range on the upper bound. 
However, both bounds are now positive constants independent of n, although they do depend on d, 
v, and C. Thus E(min^ \\w — x^) = 0(1) as required. □ 
Given a set V n C M. d and two points u, v S a V n -path of length k from u to v is a sequence 
ux±X2 ■ ■ ■ Xk-iv with u±, . . . , lifc-i G V n . As before, the energy of a path P = tto«i ... is 

k 

Ep = ^2 ik ~ n i-iir- 

i=l 

Lemma 3 Let u and w be points in Q = [O,?! 1 ^]^ at distance R uw = \\u — w\\ from each other 
and let V n be a set of n points in Q chosen uniformly and independently. Let Ek{u, v; V n ) be the 
minimal energy of a W n -path of length at most k from u to v. Then 

E(E k (u, v; V n )) = 0(m&x{k(R uw /ky,R uw + 1}) . 

Proof: Let X\, ■ ■ ■ , x^-i be A; — 1 subdivision points on the line segment from u to v, equally spaced 
so that this line segment is divided into k equal segments. Pick U{ to be the point of V n , closest to 
Xi, and also write uq = xq = u and u k = x^ = v. Then, after possibly removing loops and repeated 
vertices, uqU\ . . . gives a suitable path from u to v of energy at most ^2i=i \\ u i ~ u «-i|| i/ - Now 

\\ui - Ui-i\\ v < 3 u (±(\\ui - Xi\\ + ||uj_i - + \\xi - Xi-i\\)) u 

< 3^max{||n i - Xi\\", \\ui-i - Xj-ilT, \\xi - Xi_i||"}. 

Hence the energy of the path is at most 0(k(R uw /k) v ) + 0(^2 \\ u i ~ P\\ v )- By Lemma[2l E(^ \\ui — 
Xi\\ v ) = kQ{\). If k < R uw then the expected energy of the path is at most 0(k(R uw /k) u ), as 
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required. If k > £ uw , then we perform the same construction replacing k with to obtain a 

path of expected weight 0(R UW + 1). □ 
Given u, d, 5, our aim is construct a spanning tree T with latency at most L*(n) + 5 and 
small energy Et = Yl ■ The tree will be formed from a minimal latency tree Tl with latency 
L = L*{n) = [log 2 n\ . Some edges will be subdivided and not all vertices will be present. Some 
other minor changes to the tree may also be made. The edges of Tl of level i will be subdivided at 
most 

Si := Lci<y2*( 1 /"-V d) j (20) 

times when v > d, where c\ = 1 — 2 l l u ~ l l d (or c\ = l/[log 2 ra] if v = d) is a normalizing factor 
chosen so that £^ Sj < 5. For v < d we shall take Sj = as subdividing is not necessary. We will of 
course need to prune some leaves later so that the tree has n vertices. It is clear that the latency of 
Tl with level i edges subdivided at most times is at most L* + ^ s, < L*(n) + 5. Indeed we just 
replace the single time step L — i with 1 + Sj time steps, during which data is transmitted along the 
(at most) 1 + Si edges of each subdivided level i edge of Tl- 

Our strategy is as follows. Given a rectangular d dimensional region Q, containing n points, one 
of which is chosen as a root vertex v, we pick a coordinate, say x%, in which Q has largest extent 
and subdivide Q = Qq U Q\ so that half of the points are on each side. 

The root vertex v will be in one half, say Qq. Choose a minimal weight path P from v with at 
most so subdivisions joining utoa vertex v\ in Q\. Now inductively repeat the construction within 
Qq and Q\ using vq = v and v\ as the corresponding root vertices. At the next stage, each of Qq 
and Qi is subdivided as above, say Qq = Qqq U Qq\ and Q\ = Q\q U Q\\. Assume w.l.o.g. that 
£ QoO) ^i £ Qio- Pick a vertex vq\ £ Qq\ that has not already been used and join with a path 
with at most s\ subdivisions to ^oo = inside Qq. Similarly join some v\\ S Qn to v\q = v\ inside 
Q\. Repeat this process. At each stage, choose the v___\ to be any vertex in the other half of the 
appropriate region that has not already been used on any path so far. If no such vertex exists then 
the construction within this half terminates as we have exhausted all the vertices. 

There are two main problems with this approach, both concerning the applicability of induction 
to the process. The n points in Q were chosen to be uniform and independent. If care is not taken 
with the division of Q into Qq and Q\ then the points in Qq and Q\ will not be uniform. For 
example, it is not sufficient to order the points by the xi-coordinate and just take some subdivision 
between the (n/2)th and (n/2 + l)st points. Neither will the volumes of Qq and Q\ be exactly 
n/2, although this is a relatively minor problem as we shall see later. The second, and more serious 
problem is that after constructing the path from vq to v±, some vertices in Qq and Q± have now 
been used, the remaining points are now not independent. This second point will be dealt with 
by allowing the subdividing vertices to be reused in subsequent steps, i.e., by allowing subdividing 
paths to intersect. The resulting graph will then not be a tree, but we shall show that it will still 
have the same latency and can be modified to form a tree with no increase in latency. To solve the 
first problem mentioned above we shall use the following lemma. 

Lemma 4 Take n random points independently and uniformly in [0, 1] and order them by value 
as < Xi < X2 < • • • < x n < 1. Then x r+ i has mean and variance ^+rf™n+2) ' Moreover, 
conditioned on the value of x T +\, the set of values {x\, . . . ,x r } can be obtained by taking r random 
points independently and uniformly distributed in [0, x r +i]. 

Proof: The probability that x r +i lies in a small interval [x, x + dx] is given as the probability 
that some point is in the interval times the probability that there are exactly r of the remaining 
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n — 1 points in [0, x]. Thus the pdf of x r +i is n( n ~ 1 )x r (l — x) n ~ r ~ 1 . This is just the pdf of the beta 
distribution with parameters r + 1 and n — r. The mean and variance can be obtained by using 
standard results on the beta distribution, or by a straightforward calculation. For the conditioning 
result, imagine the points are located in small bins of size dx, chosen so that it is very unlikely that 
two points appear in the same bin. Conditioning on x r +i lying in the bin [x, x + dx] is equivalent to 
conditioning on one point lying in this interval, r points lying in earlier bins, and n — r — 1 points 
lying in later bins. However and valid assignment of points to bins is equally likely, so the conditional 
distribution of the first r points is equivalent to choosing r points uniformly and independently in 

[0,4 □ 

Corollary 1 If r = n/2 + O(l) in Lemma\Q and c = ^ + 0(l/n), then for u > 

E(max{x r+ i/c, 1}") = 1 + 0(l/^/n). 

Proof: Note that x r +i/c is bounded between and 2 + e for large n, so max{x r +i/c, 1} U < 
1 + K\x r +i — c\ for some constant K = K{u) > 0. Now 

E(|a; P+ i - c|) 2 < E(|x r+ i - c| 2 ) = Var(x r+ i) + (E(x r+ i) - c) 2 . 

However, by LemmadJ E(x r +i) — c = 0(l/n) and Var(x r +i) = 0(l/n). The result follows. □ 

Proof of the upper bound for Theorem 1: We follow the strategy outlined above. At step i have 
ni vertices, \n/2 l \ < m < \n/2 l ~\, chosen uniformly at random from a rectangular parallelepiped 
Q = Y\[0,aj] of volume n{ and aspect ratio cij/afc < 3 for all j, k. We wish to construct a spanning 
tree of latency at most Lj = L*{n) — i + ^j>i s j °f expected energy at most M{\ the value of Mj to 
be determined. 

If rii = 1 then there is nothing to do, so assume n« > 1. Pick a coordinate Xj with maximum 
value of aj. W.l.o.g. j = 1. Order the nj points by their xi-coordinates, which are uniform and 
independent in [0, a\\. Let xi(r) be the value of the rth xi-coordinate. Define Qq to be the subset of 
points of Q with xi-coordinate less than xi(ni + \ + 1), where nj + i = [ni/2\ £ [[ra/2 i+1 J, [~ra/2 i+1 ]]. 
Let Q\ to be the subset of points of Q with xi-coordinate more than xi(rij-|_i). Note that Qq and 
Qi overlap, however their intersection contains no points of V n . Also note that (with probability 
1), there are ni + i points in Qq and ni — ni + i = [nj/2] € [[n/2 l+1 J, [~ra/2 i+1 ~|] points in Q\. By 
Lemma HI conditioned on the shape of Qq, say, the points in Qq are uniformly distributed inside 
Qq. (There is however a dependency of these points on the shape of Q\.) If the root lies in Qq 
then we use this as the root of Qq, otherwise we pick any point in V n n Qq as the root. Shrink Qq 
in the xi-direction by a factor Xq = (xi(nj + i + l)/ai)/(nj+i/nj) to obtain a set Q' Q with volume 
exactly rij+i. Now construct a spanning tree of latency at most Li + i on the points in Q' Q . Let Eq 
be the energy of this tree. Now scaling by a factor of Xq we obtain a tree To in Qq of expected 
energy at most max{l, Xq} u Eq, all lengths having increased by a factor of at most max{l, Xq}. 
Note that Xq and Eq are independent, so the expected value of the energy of this tree is at most 
E(max{l, Xq} u )HL(Eq) which is at most (1 + C/y / n7)E(i?o) by Corollary [TJ Since a± was maximal 
and the xi-extent of Q' Q is a\ni + i/ni > a\/2>, the aspect ratios of Q' are again at most 3. The 
points in Qq are distributed uniformly at random and the volume of Q' is rij+i. Thus by reverse 
induction on i, ~E(Eq) < Mi+\. A similar argument also applies to Q\, so we obtain two trees with 
expected total energy of at most (1 + C / ^fni)Mi + i. By Lemma [3] we can find a path from the root 
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■uo of Qo to the root v\ of Q\ of length at most 1 + Sj and expected energy at most 

C'm a x{n u l /d (l + s i ) 1 - u ,n 1 i /d + l} 

Note that this path will reuse vertices of the trees To and T\ , so that we may obtain a graph G with 
cycles. However, the latency of G is still at most L{. Indeed, in the first Lj+i = L*(n) — (i + 1) + 
Ylj>i+i s j steps we transmit the data to vq and v\ in To and T\ separately. Then in the last 1 + Sj 
steps we transmit the data along the vq — v\ path P to whichever of these vertices was the root 
of Q. If one insists on having a tree then one can modify G slightly as follows. Any vertex of To 
or T\ that lies in P holds the data rather than transmitting it on during the first L, l+ i steps. Tree 
edges from these vertices towards the roots vq and v\ are deleted from To and T\ as these edges 
are no longer used. The trees To and T\ now become a forest of possibly many trees. All the data 
after the first Lj + i steps is located at vertices that are roots of all the individual trees in this forest. 
These vertices all lie on P. The path P then joins all the roots of these small trees to form one tree 
and data can be swept up along P in the last 1 + Si time steps. 

It now remains to bound the expected energy of this tree. We have an upper bound given by 
the energies of To, Tt, and P of 

Mi := 2(1 + C/^i)M l+l + C'max{< /d (l + Sl ) l ~ u , n] /d + 1} (21) 

where in order to maximize the expression we take hi = \n/2 l \ and rii = [~n/2*~|. Note Mi = for 
i > log2 n as then n$ < 1. Inductively substituting the bound in (|2ip for Mj+i gives 

Mi < ( J](l + C/y/i%)) Yl C " 2? ~ l max {"j /d ( 1 + s i) 1_y ' n T + !} 

j>i j>i 

Now the hj increase exponentially as i decreases from log 2 n, thus the product nj>i(l + C/y/nj) 
can be bounded independently of n and i. Hence 

log 2 n 

M < C" 2% max{(n/2y/ d (l + Si ) l ~ u ', {n/2 i ) l ' d + 1} 

log 2 n log 2 n 

< C" &(n/tf) v / d (l + Si) 1 "^ + C" 2 i ((n/2 i )V«' + 1) (22) 

i=0 i=0 

where C" absorbs this product and also any factors that arise from the difference between rij and 
n/2 3 . The second sum in (|22p is geometrically increasing for d > 1 so has sum 0(n). Assume v > d. 
Then Si = Q(52 i( - 1 / U ~ 1 /^) and 1 + (1/u - l/d)(l - u) - vjd = (l/v - l/d) < 0. Thus the first sum 
in (|22p is decreasing geometrically when > 0, and also decreasing geometrically at an even faster 
rate when Sj = 0. Hence this sum gives O(n^ d (l + 8 ) 1 - V ). Thus 

M = 0(max{n,n^ d (l + J) 1 ^}). 

for v > d. For i/ = d > 1, Sj = so = 0(5/ log n) is constant. Thus the terms in the first sum are 
equal and add up to (logn)O(n(l + sq) 1 Thus 

Mo = 0(max{n,n(logn)(l + 8/ log n) 1_I/ }). 

for v = d > 1. Finally, for v < d, Sj = and the first sum is increasing geometrically. This sum is 
then 0{n) and so 

M = 0(n) 

for 1 < v < d. □ 
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